Efficient measurement of quantum dynamics via compressive sensing 
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The resources required to characterise the dynamics of engineered quantum systems — such as quantum com- 
puters and quantum sensors — grow exponentially with system size. Here we adapt techniques from compressive 
sensing to exponentially reduce the experimental configurations required for quantum process tomography. Our 
method is applicable to dynamical processes that are known to be nearly-sparse in a certain basis and it can 
be implemented using only single-body preparations and measurements. We perform efficient, high-fidelity 
estimation of process matrices on an experiment attempting to implement a photonic two-qubit logic-gate. The 
data base is obtained under various decoherence strengths. We find that our technique is both accurate and noise 
robust, thus removing a key roadblock to the development and scaling of quantum technologies. 



Understanding and controlling the world at the nanoscale — 
be it in biological, chemical or physical phenomena — requires 
quantum mechanics. It is therefore essential to character- 
ize and monitor realistic complex quantum systems that in- 
evitably interact with typically uncontrollable environments. 
One of the most general descriptions of open quantum sys- 
tem dynamics is a quantum map — typically represented by 
a process matrix Q. Methods to identify the process ma- 
trix are collectively known as quantum process tomography 
(QPT) QH4). For a <i-dimensional quantum system, they re- 
quire G(d A ) experimental configurations: combinations of in- 
put states, on which the process acts, and a set of output ob- 
servables. For a system of n of the simplest quantum objects, 
namely qubits — two-level quantum systems — d=2 n . The re- 
quired physical resources hence scale exponentially with sys- 
tem size. In principle, a single generalized measurement is 
sufficient for full process tomography in a extended Hilbert 
space relying on highly nonlocal many-body measurements 
that are physically unavailable Q. Recently, a number of al- 
ternative methods have been developed for efficient and se- 
lective estimation of quantum processes (6H3. However, full 
characterization of quantum dynamics of comparably small 
systems, such as a recently demonstrated 8-qubit ion trap (9), 
would still require over a billion experimental configurations, 
clearly a practical impossibility. So far, process tomography 
has therefore been limited by experimental, and — to a lesser 
extent — by off-line computational resources, to systems of 2 
and 3 qubits (10HI3. 

Here we adapt techniques from compressive sensing (CS) 
to develop an experimentally efficient method for QPT. It re- 
quires only 0(s log d) configurations if the process matrix is 
s-compressible in some known basis, i.e., it is nearly sparse 
in that it can be well approximated by an s-sparse process 
matrix. This is usually the case, because engineered quan- 
tum systems aim to implement a unitary process which is 
maximally-sparse in its eigenbasis. In practice, as observed 
in QPT experiments in liquid-state NMR 03HI2L photonics 
(TOlQSlIn), ion traps fl8l . and superconducting circuits ITU , 
a near-unitary process will still be nearly-sparse in this ba- 
sis, and still compressible. The near sparsity emanates from 



decoherence originating in few dominant system-environment 
interactions. This is more apparent for weakly decohering sys- 
tems lfT9ll20l 

We experimentally demonstrate our algorithm by estimat- 
ing the 240 real parameters of the process matrix of a canon- 
ical photonic two-qubit gate, Fig. [T] from a reduced number 
of configurations. For example, from just 18 and 32 config- 
urations, we obtain fidelities of 94% and 97% with process 
matrices obtained from an overcomplete set of all 576 avail- 
able configurations. 

Compressive sensing provides methods for compression of 
information carried by a large- size signal into a significantly 
smaller one along with efficient convex optimization algo- 
rithms to decipher this information I2T1 [22l . Originally de- 
veloped to exploit compressible features of natural audio and 
video signals, applications of compressive sensing have re- 
cently found their way to quantum tomography: Simulations 
of compressive sensing for QPT l23l . application to ghost- 
imaging l24l . and quantum state tomography for low -rank 
density matrices (25). The latter provides a quadratic reduc- 
tion of physical resources compared to standard state tomog- 
raphy, i.e., for a density matrix of rank r, O (rd log 2 d) vs. 
standard d 2 settings, and it also has the main advantage that 
rank is basis independent. 

Under reasonable assumptions, a quantum map on a d- 
dimensional space has the general representation d, 

d 2 

a, (3=1 

where x, the d 2 x d 2 process matrix, is positive semidefinite, 
X>0, and trace preserving, J2 a ,p Xa^ ] ^a=h, with {T a } 
an orthonormal matrix basis set, Tr (T^T a ) =5 ' a p. Note that 
sparsity is a property of the map representation not the map 
itself. Data is collected by preparing an ensemble of identi- 
cal systems in one of the states {pi, pk}, inputting them 
to the process x, and then measuring an observable M from 
the set {Mi, ...,M^}. For a pair (p, M), the outcome will 
be yM, P =r ^{S{p)M). If the experiment is repeated for all 



configurations, i.e., (pi, Mi), i=l, . . . , m=k£, the relation be- 
tween the vector of outcomes y=[yM lyPl , • • • , yM m , Prn ] T and 
the true process matrix, denoted by xo> can be represented by 
a linear map y=&xo> where xo is the vectorized form of the 
process matrix xo and ^isanmxd 4 matrix of coefficients of 
the form Tr(T a piT^Mi)/y/m. 

In general, estimating a sparse process matrix with an un- 
known sparsity pattern from an underdetermined set of linear 
equations (m<d A ) would seem highly unlikely. Compressive 
sensing, however, tells us that this can be done by solving for 
X from the convex optimization problem: 



minimize ||x||^ subject to \\y - ®x\\i 



(2) 



and positive- semidefinite and trace-preserving conditions as 
defined above. The parameter e quantifies the level of uncer- 
tainty in the measurements, that is, we observe y=$xo+ w 
with \\w\\ i2 < e. From [22 ESI, recovery via ^ is ensured if 
(i) the matrix <£> satisfies the restricted isometry property: 



1 - 6 a < 



\\$Xi(s)-®X2(s)\\t 
IIXiO) -X2( s )fe 2 



< 1 



(3) 



for all s-sparse Xi( 5 )? X2(s) process matrices; (ii) the isome- 
try constant 62s < V% — 1 and (iii) the number of configurations 
m > Cos\og(d 4: /s). Under these conditions, the solution x* 
of ([2]) satisfies, 



Xo| 



■C 2 e 



(4) 



where Xo( s ) is the best s-sparse approximation of xo and 
Co , Ci,C2 are constants on the order of 0(5 S ), see Ap- 
pendix [B] The restricted isometry property states that two s- 
sparse process matrices Xi( s ) and X2(s) can be distinguished 
if their relative distance is nearly preserved after the mea- 
surements, i.e., under transformation by <£. If the measure- 
ments are noise free, e=0, and the process matrix i s ac tu- 
ally s-sparse, Xo=Xo(s), then the right hand side of (C2) is 
zero leading to perfect recovery, x*=Xo- Otherwise the so- 
lution tends to the best s-sparse approximation of the pro- 
cess matrix plus the additional term due to measurement er- 
ror e. If for an n-qubit QPT with d=2 n the conditions of 
the above analysis are satisfied, then the number of experi- 
mental configurations m scales linearly with sn, specifically, 
m>Co<s(4n log 2— log s)=0(sn). In the appendix, using the 
measure concentration properties of random matrices, follow- 
ing the arguments in l22l l26ll , we show that if <l> is con- 
structed from random input states {pi}, and random observ- 
ables {Mi}, then the restricted isometry in ^ holds with high 
probability. Also a test is presented to certify the sparsity as- 
sumption. 

A nearly-sparse process matrix can thus be encoded into 
an exponentially smaller number of measurement outcomes, 
which can be recovered to within the bounds of ( |C2[ ) by 
solving ([2]). We now test our algorithm experimentally 
against standard QPT on a two-qubit gate under a range 
of decoherence — and thus sparsity — conditions. We used 
a photonic controlled-phase, CZ, gate, Fig. [T] where the 




State Preparation 

Iqwp |hwp 



^PBS 



|APD 



FIG. 1: Experimental scheme. Two-photon inputs were prepared 
with either (a) a high-rate, non-scalable, two-photon source or (b) a 
low-rate, scalable, four-photon source. The qubits are encoded us- 
ing polarisation, as described in the text. The quantum process is a 
photonic entangling-gate. A measurement configuration is defined 
as some combination of state preparation and tomography, imple- 
mented here with quarter- and half- waveplates (QWP, HWP), polar- 
izers (PBS), and photon detectors (APD). For details see Appendix 
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FIG. 2: Process fidelities vs. number of input-output configurations, 
for each compressive QPT estimate, % m , in the gate-basis of the ideal 
CZ-gate for the lowest measured noise level, 7^=0.91. The dashed 
line shows the fidelity of the full estimate ^(Ucz, X576)=0.89 (black 
diamond). Error bars are obtained by solving (|2) for 50 different 
random combinations of m inputs and observables. 



qubits are encoded in orthogonal polarization states of single 
photons (\H), horizontal, and \V), vertical). We performed 
full process tomography (TO) [T6J Q2) of the gate with both 
2-photon and 4-photon arrangements, preparing 16 pair- wise 
combinations of the 4 input states {\H), \V), \D), \R)} 
and, for each input, measuring 36 two-qubit combina- 
tions of the observables {\H),\V),\D),\A),\R),\L)}, 
where \D) = (\H) + \V)) /y/2, \A) = (\H)-\V))/y/2, 
\R)=(\H)+i\V)/V2, and \L)=(\H)-i\V) /y/2. These 
576 input-output configurations represent an overcomplete 
set which allows the best possible estimate of the quantum 
process, denoted X576 iflQl . 

The compressed quantum process tomography (CQPT) es- 
timate of the 16x16 process matrix, denoted Xm, is ob- 
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FIG. 3: Real and imaginary process matrix elements in the Pauli basis for the CQPT estimate X32, 32 configurations (left) vs. full data estimate 
X576 , 576 configurations (right) for A) a low noise, 2-photon experiment, V=0.91, and B) a high-noise, 4-photon experiment, V=0.62. The 
CQPT reconstructions have fidelities, J-"(x576, X32), of 95% and 85% respectively. The CQPT estimation accuracy is excellent for low noise, 
and reliable even for high noise, see Appendix |G| for more details. 



tained by solving ^ with y=C se \p and $>=C se \G where p 
is the 576 x 1 experimental probabilities corresponding to 
each of the 576 configurations, G is the 576x256 matrix 
obtained from all the configurations and the basis set [T a 
in ([I])], and C se i is the mx576 matrix corresponding to tak- 
ing a selection of m < 576 of all possible configurations. 
The basis set is obtained from the singular- value decomposi- 
tion of the ideal CZ-gate: the process matrix in this basis is 
maximally sparse with a single non-zero 1,1 -element which 
equals 4. The measurement error bound s in ([2]) is chosen 
to be just slightly larger than y/ma, where a is the mini- 
mum feasible root-mean-square level obtained from ^ us- 
ing all configurations, i.e., with C se i=i576- We quantify de- 
coherence using the process purity, V=Tr(x^ n /d 2 ), which 
varies from for a completely decohering channel, to 1 for 
a unitary process: in our experiment we used six decoher- 
ence levels (see Appendix [G] for details), giving purities of 
{0.62, 0.74, 0.77, 0.79, 0.86, 0.91} ± 0.01. 

Figure [2] shows, for the lowest decoherence level, the pro- 
cess fidelities |[T0li versus the number of randomly-selected 
configurations, m. Each process matrix, {x™}> is obtained by 
solving ([2]). We use the fidelity between (i) the compressive 
measurement and the ideal, .F(?7cz> X™); and (ii) the com- 
pressed and optimal measurements, J-XX576? Xm)- Note that 
as m increases the fidelity with the ideal converges to the value 
of 0.89 obtained from X576; likewise, the fidelity with the full 
estimate converges to unity. Similar plots exist for every level 
of decoherence, with fidelities reduced accordingly. 

We have so far used random selections of probabilities from 
the full data set, which allows us a comprehensive test of com- 
pressive sensing theory. Experiments, however, don't yield 
probabilities but physical quantities, e.g. count rates. To date, 
algorithms for more efficient state 1 25 ] or process tomography 
have assumed probabilities as a starting point. Since normal- 
ization is an issue to some extent in all physical architectures, 
it will be necessary to investigate the robustness and scalabil- 



ity of algorithms for real- world experiments. 

For our photonic two-qubit gate, which is lossy and intrin- 
sically probabilistic, the probabilities were obtained by nor- 
malising counts using a full basis set of observables extracted 
from all measurements, I^q. Having sufficient configura- 
tions to allow for normalisation necessarily imposes limits on 
CQPT efficiency: for low m, we are restricted in how random 
our selections can be. (Details and some permissible configu- 
rations in Appendix [E]). As an example, Fig. [3] shows process 
matrices reconstructed via CQPT from just one of these con- 
figurations compared to the respective full data estimates. We 
used 32 combinations of the 16 inputs {\H), \R)} 
and 2 observables {\R) 1 1) \R)}, where I is the identity. 
The agreement is excellent as one can see from the fidelities 
and the correct reproduction of imaginary elements — which 
are ideally zero. Another striking feature is that we obtain 
highly faithful reconstructions of a non-local process using 
only local measurements 0. 

A further crucial test is whether CQPT enables us to locate 
errors and implement necessary corrections: a common exam- 
ple is identifying local rotations that move the process closer 
to the ideal. By optimising T{TJ C7j , X32), we calculated local 
corrections to X32; applying them to the full estimate X576, 
^(Ucz, X576) improved, on average, over all decoherence lev- 
els, by 4.1%. This is very close to the average 4.9% improve- 
ment obtained by calculating and applying local corrections 
directly to X576- Even a low-configuration CQPT estimate of 
a noisy process therefore enables improvements. 

That high-fidelity estimates are obtained by CQPT can be 
understood from the error bound ( |C2| ) which shows that the 
CQPT estimate tends towards the best s-sparse approximation 
of the true process, in this case our best estimate X576- Fig. [4] 
shows the process matrix elements, sorted by relative mag- 
nitude, for low and high noise levels, in two basis sets. The 



s-sparse approximation levels indicated in (C2) are reached 
where the matrix elements drop below the error threshold 
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FIG. 4: Absolute values of the 256 process matrix elements of X576 
for our lowest and highest noise level, sorted by relative magnitude 
(with respect to the 1,1 -element) in the CZ basis (top) and the Pauli 
basis (bottom). The error threshold, which indicates the required 
number of configurations, is shown in grey. 



(0.01-0.02). For the corresponding m, we can therefore ex- 
pect a successful, high-fidelity, CQPT reconstruction. In the 
CZ-basis, the plots show that for low noise, sG[20, 30], which 
correlates well with the fidelities in Fig. [2j for high noise 
se[40,60]. Although the process matrix is still somewhat 
sparse in the Pauli-basis (Fig. [3]), the corresponding plots in 
Fig. [4] indicate that ~100 configurations are needed to obtain 
an estimate of comparable quality. Furthermore, the sorted 
magnitude values in the CZ basis decay exponentially, which 
is sufficient to declare the process matrix s-compressible, e.g., 
|27]|28]|. Intriguingly, this exponential decay is a signature 
of model-based compressive sensing where the scaling goes 
from m=G(s log(d/s)) to m=0(s) |28|. This demands fur- 
ther investigation, since it appears that QPT fits this frame- 
work, particularly when the process matrix is expanded in the 
ideal basis corresponding to the unitary design goal. 

Our experimental results are supported numerically by sim- 
ulations of a 2-qubit process, see Appendix [HJ and of 3-, and 
4-qubit processes, see Appendix [I] 

Applying CQPT to larger systems will require careful atten- 
tion to classical post-processing which — as in QPT — scales 
exponentially. The standard software we used here (see Ap- 
pendix[F]), can easily handle 2 and 3 qubit CQPT systems. For 
larger systems, more specialized software can increase speed 
by orders of magnitude, e.g., l27l . 

A number of research directions arise from this work: in- 
corporating knowledge of model structure properties; tight- 
ening the bounds on scaling laws; understanding how near- 
spar sity s and rank r vary with system dimension, d\ pur- 
suing highly efficient convex-computational algorithms; and 
selection of optimal configurations. Compressive tomogra- 
phy techniques can also be applied to quantum metrology 
and Hamiltonian parameter estimation: for example, estimat- 
ing selective properties of biological or chemical interest in 
molecular systems and nanostructures with typically sparse 



Hamiltonians 1291. 



Appendix A: Norms 

Definitions of the norms used throughout the paper. For a 
vector x G C n , 



INI*! = EIUN- 



,.|2 



(Al) 



For a matrix A G C mxn with rank(A) = r < min{m, n} 
and singular values g\ > a 2 > • • • > o r > 0, 



Induced £2 norm \\A\ 
Frobenius norm \\A\ 
Nuclear norm \\A\ 



2 

fro 



SUpiun 



=1 \\AX\\ 



Frobenius norm \\A\\^ = y/Tr(A^A) = VELi ^ 



Tr(x/ZU) = £[^ 



(A2) 



Appendix B: Restricted isometry property from a concentration 
inequality 

A common approach to establish the restricted isometry 
property (RIP), Eqn.(3) in the paper, for a matrix A G C mxn 
with m < n is by introducing randomness in the elements 
of this matrix. This approach benefits from measure concen- 
tration properties of random matrices. For QPT for the mea- 
surment matrix <l> G C m in Eqn.(2) of the paper, we show 
how to achieve this with random preparation of the intial states 
and a random selection of the measurement operators. The 
proof is based on the results in l30l which show that if is a 
random matrix which satisfies the concentration property, 



Pr 



{lll^lll-||^llll>^lklll}<2e- 



2mC 3 (5 s ) 



(Bl) 



for all x G C^ 4 , where S s G (0, 1) and Cs(6 s ) only depends 
on (L, then $ satisfies the RIP, 



(l-5 a )\\x a f t <||^||J <(1 + S a )\\x a \\l 



(B2) 



for all s-sparse x s G C d . This version of RIP is equivalent 

to(g. 

In classical signal processing, each element of the $ ma- 
trix can be independently selected from a random distribution 
such as Gaussian or Bernoulli. For QPT there is no freedom 
for random independent selection of every element of the $ 
matrix. However, as described in the paper, the rows of $ 
can be independently and randomly selected. To see this, re- 
call that for each experimental configuration we can initial- 
ize the system randomly in a state p G {pi G C dxd }^ =1 
and then measure an observable M randomly chosen from 
{Mj G C dxd } £ j =1 . The corresponding matrix $ then has 

m = hi independent random rows G C lxAr }^ 1 with 
correlated elements of each row since they are functions of 
the same M and p. Observe, however, that although $ is a 
random matrix, because it is constructed from quantum states 
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and observables of a finite dimensional system, it is bounded. 

,4 

As a consequence, Mx G C , we get, 



(w*/ra) < ^(^.^t^ < (w u /m) \\x\\l 2 

i\\x\\l < K\\M\l < u\\xf i9 



(B3) 



where E denotes expectation with respect to $ and 
w u ,W£,u,£ are constants. Next we apply, 

Hoeffding's concentration inequality Let vi,...,v m be 
independent bounded random variables such that Vi falls in 
the interval [a$, bj\ with probability one. Then for S = V{ 
and any t > we have, 



Pr{S-E(S)>£} < e-^/E,^-^) 2 
Pr{S-E(S) < -t} < e - 2 * 2 /Ei(6i-ai) 2 



(B4) 



In our problem vi = | </> j" x | 2 and S = \ \ &x \ \ \ . From the above 



inequalities and the relations in ( |B3[ ) we find Vt+, t- > and 
Vz, 

Pr{^-^||x||, 2 2 >t + } < Pr{S-E(S)>U} 

< e -2^/( Wu - W ,) 2 

Pr{S-Z||a;||* 2 < -£_} < Pr{S-E(S) < -*_} 



< -2ti/(w u -w £ y 



(B5) 



The choice of t+ = (5 a + 1 — iz) ||x||^ and t- = (£ — 1 



5 S ) ||x||^ 2 in the above inequalities yields 

Pr {| ||$x|| £ 2 2 - HicHfj. | > 6s \\x\\l} < 2e- 2 "^+< 



Pr 



{| \\$y\\l - 1| > 6 s /2] < 2(12/*.)- 



-2m(<5 s /2 + e)^ 



or equivalently 1 - S 3 /2 < \\$y\\} 2 < 1 + 5 s /2 holds with 
probability exceeding 

P = 1 - 2(12/(5 S ) S exp(-2m(J a /2 + e) 2 /K - w £ ) 2 ). 

Define z to be the smallest number such that || $x' < 1 + z 
for all x' with \\x'\\ £ = 1. For a vector y G F we have, 

||$x'||, 2 < ||$y|| /a + ||d>(z' - y)\\ t2 < 1 + | + (1 + z) S -± 

from which it follows that z < S s , for any < 5 3 < 1. 
In a similar fashion we can prove 1 — S s < \\^x f \\ £ . This 



completes the proof that RIP ^ holds with probability ex- 
ceeding P for all x G X s . The number of sets X s with 
#X S = 5 is ( N a ) < (eN/s) s . Therefore RIP fails to be sat- 
isfied with probability 2 exp(— 2m(S s /2 + e) 2 / (w u — wi) 2 + 
s[log(eN/s) + log (12 / 5 s )]). For a sufficiently small constant 
Co, if Co 5 < m/log(N/s), we can find a constant < C3 
such that the probability of a failure of RIP becomes smaller 
than exp(— C^m) provided that C3 < 2m(S s /2 + e) 2 / (w u — 
vug) 2 — s[log(eN/s) + log (12/5 s )]. This guaranteed exponen- 
tially small chance of RIP failure is the key to the logarithmic 
scaling of the resources in CQPT. If RIP is satisfied the l\ 
norm minimization algorithm works to find a sparse solution. 
Here we proved that by increasing the number of configura- 
tions m would exponentially decrease the chance of RIP fail- 
ure. This completes the connection between the concentration 
measure (|B6|) and the restricted isometry property. 



y/(w u -w e y 
(B6) 

with e = min{l — u, £ — 1}. We also need t+ and t_ to be 
positive that imposes the condition 1 — 5 S < £ < u < 1 + 5 S . 
Since the obervable M can be scaled by any real factor, a 
sufficient condition isu/£ < + — 5 3 ). 

Next we reproduce the connection between the measure 
concentration ( |B6| ) and restricted isometry as demonstrated in 
(30): Let X s be a set of vectors with cardinality s: #(X S ) = 
s. We choose a set Y C X s such that \\y\\^ 2 = 1 for all y G Y, 
we have min^y \\x — y\\^ < 5 S /A for all x G X s . The car- 
dinality of such a set F can al ways be chosen to be smaller 
than (12/S S ) S [31 1. There from @ we find 



Appendix C: Performance of the algorithm 

In Ref. (26lL the accuracy of the ^i-norm minimization prob- 
lem is given by ( |C2[ ). The parameters C\ and C2 are explicitly 
given in terms of the isometry constant 5 S : 



n 2 + (2x/2-2)^ s 

^1 = —7^ N c 7 ^2 



l-(x/2 + l)^ s 



1-(a/2 + 1)^ 



(CI) 



To present all the distances based on /i-norm we can use 
\\y\\h < lbll/ 2 < V^lbHii, for a D-dimensional vector y 
and obtain the algorithm performance as 

II** - XolU < ^ ||xo(s) - Xo|| £l +d 2 C 2 e (C2) 

However the performance inequality presented in the paper 
has a tighter bound. 



Appendix D: Sparsity assumption certification 



A test to certify the sparsity assumption can be concluded 



from ( |C2| ) and the probability of RIP being satisfied exceed- 
ing 1 — e - mC 3(5 s ) f or m configurations. Suppose an esti- 
mate Xm is obtained for m configurations. If the measure 
||Xra+i — Xmll^s which quantifies an incremental improve- 
ment in the estimated process matrix, converges toward zero 
for a polynomially large m, the sparsity assumption is certi- 
fied. 



Appendix E: Normalization and Precision Issues 

In the formulation of CQPT a random selection of the expec- 
tation values yMi, Pi are not available in our experiment. Due 
to photon loss the detector counts are not conclusive, hence, a 
complete set of counts corresponding to a complete set of ob- 
servables is required to produce meaningful expectation val- 
ues yMi, Pi • A solution to this problem is to limit the measure- 
ments to few-body observables. For /c-body measurements 
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Inputs 


Observables 


m 


^(X576,Xm) 




HVDR x2 


HVDARL x2 


576 


1 


0.88 


HVDR x2 


{RIJR} 


32 


0.98 


0.89 




{DIJD} 




0.97 


0.87 


HVDR X2 


RL x2 


64 


0.95 


0.86 




DAxDA 




0.95 


0.86 


VDR x2 


{RIJR} 


18 


0.94 


0.86 




{DIJD} 




0.93 


0.88 


VDR x2 


RL x2 


36 


0.94 


0.87 




DA x2 




0.94 


0.84 



TABLE I: Fidelity assessment of some selected configurations that 
are available in our experiment. 

a total number of 2 k complementary observables need to be 
measured. Since m, the number of measurements, is expo- 
nentially small we can choose k limited to few-body opera- 
tors, k = km ax , and even single-body as we did in the exper- 
iment. For a fully random selection of observables, the total 
number of measurements m will be increased by a constant 
factor 2^ aar Still this number is exponentially small. This re- 
dundancy, however, can be avoided by using the outcomes of 
all 2 k observables. This selection scheme is not fully random, 
rather it is a deterministic-random way of choosing observ- 
ables. 

As discussed in the paper, random selections of probabili- 
ties from the full data set, although exhibiting results which 
are entirely consistent with compressive sensing theory, are 
inconsistent with how data is actually collected in this kind 
of standard photonic experiment. In practice we are limited 
to measure few-body observables. For low m, the configura- 
tions must allow for normalisation, i.e. we are restricted in 
how random our low-number selections can be. A selection 
of some of these permissible configurations are shown in Ta- 
ble [TJ Here we see some of the remarkable results promised by 
the theory of compressed sensing, e.g., a 98% fidelity from 32 
configurations and a 94% fidelity from only 18 configurations. 

Another issue to consider is experimental precision. The 
expectation values of /c-body observables of random states re- 
duce for a larger k. This implies the need for a larger number 
of statistical samples. Fortunately, this issue is not a problem 
for our scheme since we can take k as small as we want, as 
discussed above. 



Appendix F: Classical postprocessing 

The estimation results computed from the experimental data 
were all obtained by solving equation 2 in the main text by 
using "off-the-shelf" MATLAB based software. Specifically, 
we used YALMIP to call the convex solver SDPT3 l32ll33l . 
On a standard desktop it takes about 2 sec of CPU-time to 
solve (2) for the full 576 configuration set. This software can 
handle 3-qubit systems but it is more advisable to migrate to 
more specialized software where orders of magnitude speed 
increases are possible, e.g., l27l . 



loss 




^beam-splitter (reflectance n=1/3) PBS 



FIG. 5: Detailed representation of the CZ-gate in dual rail encoding. 
Each qubit is represented by two paths, one for each logical basis 
state, |0> \H) and |1) \V) OiU36l . 



Appendix G: Experimental Details 

The quantum gate used in the experiment is a photonic 
controlled-phase gate, Fig. [5] I34TI361. It is based on a sin- 
gle partially polarising beam splitter (PPBS), having differ- 
ent reflectivity, Vv = ^ Vh=0, for the horizontal and the ver- 
tical polarisation of input photons. Due to two-photon in- 
terference, the input state \VV) undergoes a tt phase shift 
\VV) —> — \VV) whenever the two photons leave the PPBS 
through different output ports. Correct operation of the gate 
is signalled by a coincidence detection in these output modes; 
the gate is thus probabilistic, with a success probability of 1/9. 

The gate acts on photonic qubits created via spontaneous 
parametric downconversion (SPDC). Downconversion is in- 
trinsically a random process: consequently the created states 
contain small amounts of higher-order emission — e.g. |22) 
as well as the desired |11) — which appear as decoherence in 
a quantum process (37l [38) . The ratio of higher order terms 
to the desired photon pair number increases with the pair cre- 
ation probability, which in turn is proportional to the pump 
laser power. Once can therefore — to some extent — control the 
decoherence in a process via the laser power. 

In order to cover a comprehensive range of decoherence, we 
performed six experiments with 2-photon states directly cre- 
ated via a single SPDC emission, and one experiment with 
4-photon states created in two independent SPDC sources, 
where one photon of each SPDC process was used as a trigger. 
The latter experiment is more representative of large-scale 
systems, where independent photon sources will be required. 
It has significantly reduced count rates, and reduced two- 
photon interference between photons in the quantum gate due 
to both the pump-induced decoherence and group-velocity 
mismatch [37], reflected in the low purity of the process in 
this case of 0.62. 

Typical count rates for 2-photon experiments are 2000 co- 
incident counts per second, full QPT, building up reasonable 
statistics, takes about 2.5 hours; in contrast, 4-photon experi- 
ments have much lower rates, 1 four- fold coincidence per sec- 
ond, and take 2 days. The 32-configuration CQPT reduces 
tomography times to 8 minutes and 2.6 hours respectively: a 
clear advantage. 

Fig. [6] shows the effect of varying laser pump power on 
CQPT estimation accuracy for one of the single-observable 



7 



0.95 



0.9 



v 0.85 



0.8 



0.75 



In = {\H),\V),\D),\R)}x{\H),\V),\D),\R)} 
Out = \RI) x \RI) 



O 
□ 



□ ^(t/ C z,X576) 
O ^(f/cz,X32) 
A -F(X576,X32) 



0.7 
0. 



0.65 



0.7 0.75 

Purity (x 5 - 



0.8 



0.85 



0.9 



FIG. 6: Fidelities vs. purities for m = 32 corresponding to the 
configurations in Table [I] 



configurations from Table |T| Specifically for the 32 configura- 
tions arising from all combinations of the 16 inputs HVDRx- 
HVDR and 2 outcomes {RI,IR}. As pump power increases, 
the process purity, as measured by Tr(xl 76 )/16 decreases; 
effectively the signal to noise ratio deteriorates. As might be 
expected, the worst-case fidelity decreases with process pu- 
rity. The estimated channel fidelity is however remarkably 
robust, staying very close to the actual channel fidelity. 



Appendix H: Simulation results 

QPT is performed by solving ^ with noise-free experi- 
ments (e = 0) for a system designed to be a 2-qubit quantum 
Fourier transform (QFT) with unitary representation f7 q ft G 
C 4x4 , which interacts with an unknown environment via the 
total constant Hamiltonian, H=I e ® H q f t +^H with \\H ||=1; 
7 is thus the interaction magnitude. The simulated system 
Xsim G C 16x16 is extracted via the partial trace over the en- 
vironment for 7 G {0.5, 1.0, 1.25}. Each of these induces 
a fidelity with respect to the ideal unitary, F(Uqtt, Xsim) G 
{0.70, 0.80, 0.95} The estimates from ^ are obtained in the 
singular value decomposition (SVD)-basis [T a in ([I])] of the 
ideal QFT. The process matrix of the ideal unitary in this ba- 
sis is maximally sparse with the single non-zero 1,1 -element 
equal to n = 4 [ 23 ]. The environmental interactions make the 
process matrix almost sparse as defined in ([2]). 

To form the measurement matrix <l> G C mx256 , we ran- 
domly generated 4 and 16 input pairs (g) l^) and 2, 4, 
and 6 random selections from the single-body Pauli observ- 
ables {IX, IY, IZ, XI, YI, ZI}. This gives 6 configurations 
with m G {8, 16, 24, 32, 64, 96}, for which ujt « 1.3 ensur- 
ing 5 « 0.13. Fig.[7]shows the fidelities ^(Xm^Xsim) of the 
reconstructed estimates \m and the simulated process matri- 
ces Xsim for all 18 combinations of m and interaction magni- 
tudes 7. 

These results arise from the relative sparsity of the process 
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FIG. 7: Fidelities vs. configurations for each process matrix esti- 
mate Xm from ([2]) in the SVD basis of the ideal QFT unitary. Black 
bars: simulated compared to ideal process J-*(£/qft, Xsim)- Gray bars: 
estimate compared to ideal .F(£/qft, Xm). White bars: estimate com- 
pared to simulated process J(xm, Xsim). 



matrix in the SVD-basis of the ideal QFT. Fig.[8]shows 3D bar 
plots of the real and imaginary elements of the true and esti- 
mated process matrices for m = 64, ^(C/qft, Xsim) = 0.70, 
and J r (x64, Xsim) = 0.93. In the SVD-basis (row 2), the true 
process matrix exhibits the expected large 1,1 -element with 
the remaining elements much smaller by comparison. The es- 
timated channel fidelity is 0.71. 

In Fig. [7] 7(xm, Xsim) (white bars) trends to increase with 
m, more so for T = 0.7 than for T = 0.95, and rises a bit 
sharply at different m values. Just as for the experimental 
results, this can be connected to the actual sparsity of the sim- 
ulated process matrices. Figure [9| just like Fig. |4]in the main 
text, shows the absolute sorted process matrix elements rela- 
tive to the 1,1 -element. Where each plot crosses the threshold 
of 0.02, we see that the number of elements above this value 
increases with decreasing decoherence 7. If these are taken 
as the s-sparse approximation levels indicated in the theory 
([2]), then (approximately) s G {30, 50, 100} correspond to 
^(^ q ft,Xsim) e {0.95,0.80.0.70}. This correlates well with 
how j(x m , Xsim) varies with resources m. 



Appendix I: Beyond 2-qubits 

Standard QPT scales exponentially, thus for 3 and 4 qubits 
the number of required experimental configurations is, respec- 
tively 4,032 and 65,280. As we have shown theoretically, ex- 
perimentaly, and lastly via the previous simulations, CQPT 



shows quite a different scaling. Fig. [10| shows the absolute 
values, sorted by relative magnitude, of the process matrices 
arising from a random selection of a perturbed system near 
identity, i.e., a quantum memory, corresponding to similar fi- 
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delities. The process matrices elements are shown in a ba- 
sis corresponding to the ideal identity. Again taking 0.01 as 
a threshold we see that for 2-qubits we get m w 20 which 
is similar to our experimental results and those supported by 
the plots in Figures [4] and [9] Fig. [TO] predicts for 3-qubits 
m « 100, and for 4-qubits m « 300. These simulation results 
show first that the process matrices are compressible, and in 
addition are consistent with the experimental results in Fig. [4] 
To actually perform the estimaton, that is solve ([2]), as pre- 
viously mentioned, requires specialized compressed sensing 
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FIG. 8: Real and imaginary x elements for ra=64, ^(C/qft, Xsim) = 
0.71, 7=1.25. Row 1: True process matrix in the natural ba- 
sis. Row 2: True process matrix in SVD-basis of ideal unitary. 
Row 3: Estimated process matrix projected to the natural basis , 

•F(t/qft,Xm)=0.71. 
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FIG. 9: Absolute values of the 256 process matrix elements of 
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sorted by relative magnitude (with respect to the 



11 -element) for J^qft, <S true ) e {0.95,0.80,0.70}. 
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FIG. 10: Absolute values of the process matrix elements sorted by 
relative magnitude (with respect to the 11 -element) all in the ideal 
SVD basis (in this case for an identity operator on the system) for 
three cases: blue Xsim G C 16x16 with T(h, Xsim) = 0.84; red 
Xsim e C 64x64 with F(Is, Xsim) = 0.83; green X sim G C 256x256 
with T(Ii6, Xsim) = 0.85; 



algorithms optimized for speed and efficiency, e.g., (27). 
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